MiniCluster UI Report

MiniCluster HTML Report

Throughput

127.8000

Final Loss

0.3348

Total Time (s)

18.3000

Power (W)

286.5000

Run Metadata

Toolminicluster 0.1.0
Timestamp2026-02-21T02:58:27.987985
HardwareCPU
Frameworkpytorch 2.10.0+cpu
Workloadcluster_health_demo_run (training)

Configuration

num_processes1.0000
num_steps10.0000
batch_size16.0000
learning_rate0.0100
hidden_size128.0000
num_layers2.0000
collective_backendnccl
workloadtransformer
seed42.0000
tdp_watts150.0000
loss_tolerance0.0100
regression_threshold5.0000

Metrics

Throughput (samples/sec)127.8000
Final Loss0.3348
Total Time (s)18.3000
P50 All-Reduce (ms)2.6200
P95 All-Reduce (ms)3.7100
P99 All-Reduce (ms)4.2400
Max All-Reduce (ms)4.4300
All-Reduce StdDev (ms)0.3900
Power (W)286.5000
Performance/Watt0.4460
Energy/Step (J)2.2400
Temperature (C)69.8000
Communication Overhead (%)N/A
Scaling Efficiency (%)92.4000

Training Graphs

Loss by Step

Throughput by Step

Per-Step Timing (Compute + AllReduce)